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1 Set-oriented production rules in relational database systems ioo% 

vft Jennifer Widom , S. J. Finkelstein 

— 1 ACM SIGMOD Record , Proceedings of the 1990 ACM SIGMOD international 
conference on Management of data May 1990 
Volume 19 Issue 2 

We propose incorporating a production rules facility into a relational database 
system. Such a facility allows definition of database operations that are automatically 
executed whenever certain conditions are met. In keeping with the set-oriented 
approach of relational data manipulation languages, our production rules are also 
set-oriented— they are triggered by sets of changes to the database and may 
perform sets of changes. The condition and action parts of our production rules may 
r ... 



2 Pipelined data parallel algorithms— concept and modeling ioo% 

U C.-T. King , W.-H. Chou , L. M. Ni 

Proceedings of the 2nd international conference on Supercomputing June 1988 
A new style of efficient parallel algorithms on distributed-memory multiprocessors is 
introduced, which exploits parallelism through pipelined parallel computation, or 
large-grain pipelining. By using macro-pipelining between nodes in the system, 
large-grain pipelining regulates the flows of data in the multiprocessor so that the 
degree of overlapping can be maximized and the effect of communication overhead 
can be minimized. To model pipelined parallel computations, an ana ... 



3 Code extension in ASCII ioo% 

S. Gorn 

Communications of the ACM October 1966 
Volume 9 Issue 10 

The American Standard Code for Information Interchange (ASCII) contains a number 
of control characters associated with the principle of code extension, that is, with the 
representation of information which cannot be directly represented by means of the 
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characters in the Code. The manner of use of these characters has not previously 
been completely described. This paper presents a set of mutually consistent 
philosophies regarding code extension applications, and s ... 

4 Comparison of two-dimensional FFT methods on the hypercube 99% 

□) c. Y. Chu 

Proceedings of the third conference on Hypercube concurrent computers and 

applications - Volume 2 January 1989 

Complex two-dimensional FFTs up to size 256 x 256 points are implemented on the 
Intel iPSC/System 286 hypercube with emphasis on comparing the effects of data 
mapping, data transposition or communication needs, and the use of distributed 
FFTs. Two new implementations of the 2D-FFT include the Local-Distributed method 
which performs local FFTs in one direction followed by distributed FFTs in the other 
direction, and a Vector-Radix implementation that is derived from decimating the 
DFT in two ... 



5 A modular systolic architecture for image convolutions 99% 

Eft K. Doshi , P. Varman 

Proceedings of the 14th annual international symposium on Computer 

architecture June 1987 

This paper describes a modular, systolic design for two-dimensional convolution 
which is a frequent and computationally intensive operation in low-level image 
processing. The design consists of a one-dimensional array of homogeneous cells, 
each with a fixed amount of storage. The paper also presents schema by. which the 
design consisting of a limited number of cells can be used to implement convolutions 
of varying kernel siz ... 



6 Application-specific memory management for embedded systems using 99% 
2) software-controlled caches 

Derek Chiou , Prabhat Jain , Larry Rudolph , Srinivas Devadas 
Proceedings of the 37th conference on Design automation June 2000 

We propose a way to improve the performance of embedded processors running 
data-intensive applications by allowing software to allocate on-chip memory on an 
application-specific basis. On-chip memory in the form of cache can be made to act 
like scratch-pad memory via a novel hardware mechanism, which we call column 
caching. Column caching enables dynamic cache partitioning in software, by 
mapping data regions to a specified sets of cache "columns" or "ways ... 



7 Dynamic object management for distributed data structures 98% 

□) B. K. Totty , D. A. Reed 

Proceedings of the 1992 ACM/IEEE conference on Supercomputing December 
1992 

8 An overview of query optimization in relational systems 97% 

Surajit Chaudhuri 

Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on 
Principles of database systems May 1998 
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1 Comparison of two-dimensional FFT methods on the hypercube 98% 

Q) C. Y. Chu 

Proceedings of the third conference on Hypercube concurrent computers and 

applications - Volume 2 January 1989 

Complex two-dimensional FFTs up to size 256 x 256 points are implemented on the 
Intel iPSC/System 286 hypercube with emphasis on comparing the effects of data 
mapping, data transposition or communication needs, and the use of distributed FFTs. 
Two new implementations of the 2D-FFT include the Local-Distributed method which 
performs local FFTs in one direction followed by distributed FFTs in the other direction, 
and a Vector-Radix implementation that is derived from decimating the DFT in two ... 



2 An overview of query optimization in relational systems 97% 

Surajit Chaudhuri 

Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on 
Principles of database systems May 1998 



3 Pipelined data parallel algorithms— concept and modeling 95% 

g) C.-T. King , W.-H. Chou , L M. Ni 

Proceedings of the 2nd international conference on Supercomputing June 1988 
A new style of efficient parallel algorithms on distributed-memory multiprocessors is 
introduced, which exploits parallelism through pipelined parallel computation, or 
large-grain pipelining. By using macro-pipelining between nodes in the system, large- 
grain pipelining regulates the flows of data in the multiprocessor so that the degree of 
overlapping can be maximized and the effect of communication overhead can be 
minimized. To model pipelined parallel computations, an ana ... 
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1 Large-grain pipelining on hypercube multiprocessors 

Q] C-T. King , L M. Ni 

Proceedings of the third conference on Hypercube concurrent computers and 

applications - Volume 2 January 1989 

A new paradigm, called large-grain pipelining, for developing efficient parallel 
algorithms on distributed-memory multiprocessors, e.g., hypercube machines, is 
introduced. Large-grain pipelining attempts to maximize the degree of overlapping 
and minimize the effect of communication overhead in a multiprocessor system 
through macro-pipelining between the nodes. Algorithms developed through large- 
grain pipelining to perform matrix multiplication are presented. To model the pi ... 



100% 



2 Warp experience: we can map computations onto a parallel computer ioo% 
efficiently 

H. T. Kung 

Proceedings of the 2nd international conference on Supercomputing June 1988 
Warp is a programmable, systolic array computer developed by Carnegie Mellon and 
produced by GE. A 10-cell Warp machine can perform 100 million floating-point 
operations per second (10 MFLOPS). A variety of applications have been mapped 
onto Warp. The experience has been that the mapping is not a real problem; in fact, 
usually near-optimal mapping is relatively easy to obtain, and the actual 
implementation of the mapping on the machine can often be automated. This paper 
explains why this is ... 



3 Pipelined data parallel algorithms— concept and modeling ioo% 

gj C.-T. King , W.-H. Chou , L. M. Ni 

Proceedings of the 2nd international conference on Supercomputing June 1988 
A new style of efficient parallel algorithms on distributed-memory multiprocessors is 
introduced, which exploits parallelism through pipelined parallel computation, or 
large-grain pipelining. By using macro-pipelining between nodes in the system, 
large-grain pipelining regulates the flows of data in the multiprocessor so that the 
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degree of overlapping can be maximized and the effect of communication overhead 
can be minimized. To model pipelined parallel computations, an ana ... 



4 Automatic data layout for distributed-memory machines ioo% 

Ken Kennedy , Ulrich Kremer 

ACM Transactions on Programming Languages and Systems (TOPLAS) July 1998 
Volume 20 Issue 4 

The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide 
a simple yet efficient machine-independent parallel programming model. After the 
algorithm selection, the data layout choice is the key intellectual challenge in writing 
an efficient program in such languages. The performance of a data layout depends 
on the target compilation system, the target machine, the problem size, and the 
number of available processors. This makes the choice of a good layout extremel ... 



5 Compiler optimizations for Fortran D on MIMD distributed-memory ioo% 
machines 

Seema Hiranandani , Ken Kennedy , Chau-Wen Tseng 

Proceedings of the 1991 ACM/IEEE conference on Supercomputing August 1991 



6 Set-oriented production rules in relational database systems 99% 

k& Jennifer Widom , S. J. Finkelstein 

L " J ACM SIGMOD Record , Proceedings of the 1990 ACM SIGMOD international 
conference on Management of data May 1990 
Volume 19 Issue 2 

We propose incorporating a production rules facility into a relational database 
system. Such a facility allows definition of database operations that are 
automatically executed whenever certain conditions are met. In keeping with the 
set-oriented approach of relational data manipulation languages, our production 
rules are also set-oriented— they are triggered by sets of changes to the database 
and may perform sets of changes. The condition and action parts of our production 
rules may r ... 



7 Comparison of two-dimensional FFT methods on the hypercube 99% 

g) c. Y. Chu 

Proceedings of the third conference on Hypercube concurrent computers and 

applications - Volume 2 January 1989 

Complex two-dimensional FFTs up to size 256 x 256 points are implemented on the 
Intel iPSC/System 286 hypercube with emphasis on comparing the effects of data 
mapping, data transposition or communication needs, and the use of distributed 
FFTs. Two new implementations of the 2D-FFT include the Local-Distributed method 
which performs local FFTs in one direction followed by distributed FFTs in the other 
direction, and a Vector-Radix implementation that is derived from decimating the 
DFT in two ... 



8 Code extension in ASCII 99% 

S. Gorn 

Communications of the ACM October 1966 
Volume 9 Issue 10 

The American Standard Code for Information Interchange (ASCII) contains a 
number of control characters associated with the principle of code extension, that is, 
with the representation of information which cannot be directly represented by 
means of the characters in the Code. The manner of use of these characters has not 
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previously been completely described. This paper presents a set of mutually 
consistent philosophies regarding code extension applications, and s ... 



9 A modular systolic architecture for image convolutions 99% 

K. Doshi , P. Varman 

Proceedings of the 14th annual international symposium on Computer 

architecture June 1987 

This paper describes a modular, systolic design for two-dimensional convolution 
which is a frequent and computationally intensive operation in low-level image 
processing. The design consists of a one-dimensional array of homogeneous cells, 
each with a fixed amount of storage. The paper also presents schema by which the 
design consisting of a limited number of cells can be used to implement convolutions 
of varying kernel siz ... 

10 Solving narrow banded systems on ensemble architectures 99% 

S. Lennart Johnsson 

ACM Transactions on Mathematical Software (TOMS) September 1985 
Volume 11 Issue 3 

We present concurrent algorithms for the solution of narrow banded systems on 
ensemble architectures, and analyze the communication and arithmetic complexities 
of the algorithms. The algorithms consist of three phases. In phase 1, a block 
tridiagonal system of reduced size is produced through largely local operations. 
Diagonal dominance is preserved. If the original system is positive, definite, and 
symmetric, so is the reduced system. It is solved in a second phase, and the 
remaining varia ... 
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Parallel Processing Symposium, 1993., Proceedings of Seventh International , 
16 April 1993 
Pages:379 - 383 



fAbstractl [PDF Full-Text (376 KB^I ieee cnf 



3 Combining logic minimization and folding for PLAs 

Yu Chin Hsu; Youn Long Lin; Hang Ching Hsieh; Ting Hai Chao; 
Computers, IEEE Transactions on , Volume: 40 , Issue: 6 , June 1991 
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[Abstract! fPDF Full-Text (592 KB)1 ieee jnl 
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Data Engineering, 2002. Proceedings. 18th International Conference on , 26 I 
March 2002 
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7 Reduced-order realization of a nonlinear power network using 
companion-form state equations with periodic coefficients 
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Third International Symposium on , 18-20 Dec. 1997 
Pages:233 - 239 

fAbstract] fPDF Full-Text (512 KB^I ieee cnf 



io Fast parallel algorithms for solving triangular systems of linear 
equations on the hypercube 

Ibarra, O.H.; Kim, M.H.; 

Parallel Processing Symposium, 1991. Proceedings., Fifth International , 30 A 
May 1991 
Pages: 76 - 83 

fAbstractl fPDF Full-Text (580 KB^I ieee cnf 



n A 30M samples/s programmable filter processor 



h 



eee 



e eee g e ch e ch e 



e 



e 



g 



Search Results Page 3 of 3 




Golla, C; Nava, F.; Cavallotti, F.; Cremonesi, A.; Piacentini, P.; Casagrande, 
Campardo, G.; 

Solid-State Circuits Conference, 1990. Digest of Technical Papers. 37th ISSC( 
1990 IEEE International , 14-16 Feb. 1990 
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[Abstract! TPDF Full-Text (636 KB)1 ieeecnf 
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1 GEMM-based level 3 BLAS: high-performance model implementations 99% 
2) and performance evaluation benchmark 

Bo Kagstrom , Per Ling , Charles van Loan 

ACM Transactions on Mathematical Software (TOMS) September 1998 
Volume 24 Issue 3 

The level 3 Basic Linear Algebra Subprograms (BLAS) are designed to perform various 
matrix multiply and triangular system solving computations. Due to the complex 
hardware organization of advanced computer architectures the development of 
optimal level 3 BLAS code is costly and time consuming. However, it is possible to 
develop a portable and high-performance level 3 BLAS library mainly relying on a 
highly optimized GEMM, the routine for the general matrix multiply and add operation. 
With s ... 



2 Full-screen, scrollable APL2 spreadsheet input/output editor 99% 

□) Peter A. W. Lewis 

^ ACM SIGAPL APL Quote Quad March 1993 
Volume 23 Issue 3 

A full-screen, scrollable, spreadsheet-like editor based on IBM's APL2 32-bit 
interpreter for 386/486-based microcomputers is described. It is used for entering, 
examining, analyzing, editing and printing data. Mixed numeric and character arrays 
can be read in from or written out to formatted DOS files (ASCII) or comma-delimited 
DOS files. Alternatively, a bulk mode input facility allows for rapid direct data entry, or 
data can be entered, examined and edited cell-by-cell in the usual way. A fac ... 



3 Extensible file system (ELFS): an object-oriented approach to high 97% 
@) performance file I/O 

John F. Karpovich , Andrew S. Grimshaw , James C. French 

ACM SIGPLAN Notices , Proceedings of the ninth annual conference on Object- 
oriented programming systems, language, and applications October 1994 
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Volume 29 Issue 10 

Scientific applications often manipulate very large sets of persistent data. Over the 
past decade, advances in disk storage device performance have consistently been 
outpaced by advances in the performance of the rest of the computer system. As a 
result, many scientific applications have become I/O-bound, i.e. their run-times are 
dominated by the time spent performing I/O operations. Consequently, the 
performance of I/O operations has become critical for high performance in these 
applicatio ... 




The architecture of Montana: an open and extensible programming 96% 
environment with an incremental C++ compiler 

Michael Karasick 

ACM SIGSOFT Software Engineering Notes , Proceedings of the 6th ACM SIGSOFT 
international symposium on Foundations of software engineering November 1998 
Volume 23 Issue 6 

Montana is an open, extensible integrated programming environment for C++ that 
supports incremental compilation and linking, a persistent code cache called a 
CodeStore, and a set of programming interfaces to the CodeStore for tool writers. 
CodeStore serves as a central source of information for compiling, browsing, and 
debugging. CodeStore contains information about both the static and dynamic 
structure of the compiled program. This information spans files, macros, declarations, 
function bodies, ... 

5 A Functional Description of ANALYZE: A Computer-Assisted Analysis 95% 
Q) System for Linear Programming Models 

Harvey Greenberg 

ACM Transactions on Mathematical Software (TOMS) March 1983 
Volume 9 Issue 1 



6 Automating the re-declaration of unneeded globals as private 

Amitava Datta , Prabhaker Mateti 
— Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: states 
of the art and practice March 1993 
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